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DETAILED ACTION 
Remarks 

1 . Upon careful reconsideration and the discovery of new evidence, Examiner has 
determined that previous claims 9 and 40, and current claims 62 and 64, do not contain allowable 
subject matter. In the previous Office action (dated 04/07/04), Examiner indicated that the 
limitations of claims 9 and 40 would be allowable if incorporated into independent form and that 
claims 63 and 64 were allowable. All four claims were held to contain allowable subject matter 
in that the prior art did not disclose the case wherein the group of viewers used to determine the 
areas of interest is a statistically representative subset of an intended audience, and the subset of 
the intended audience is utilized for the purposes of predicting areas of interest for the intended 
audience. 

U.S. Patent 4,755,045 by Borah et al. teaches this feature, and the combination of the 
Jacquin and Borah references renders current claims 1, 20, 32, 51, 63, and 64 obvious, as 
explained below. 

Claim Rejections - 35 USC § 103 

2. The text of those sections of Title 35, U.S. Code not included in this action can be found 
in a prior Office action. 

3. Claims 1,2,6,8,9-12,15-17,19,20,22-24,26,28,32,33,37,39-43,46-48,50,51,53- 
55,57,59,63, and 64 are rejected under 35 U.S.C. 103(a) as being unpatentable over U.S. Patent 
5,764,803 by Jacquin et al. ("Jacquin") in view of U.S. Patent 4,755,045 by Borah et al. 
("Borah"). 
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Regarding claim 1, Jacquin discloses a method of digital image compression (figure 1), 
comprising: 

identifying a plurality of areas of interest in at least a subset of digital images in a 
sequence of related images (column 3, lines 2-25: areas of interest in a current video frame are 
identified by the assignation of ellipses via the ellipse identifier 30); 

encoding the identified areas of interest at a first quality level and unidentified ares of the 
image at a second and lower quality level than the identified areas in order to produce a single 
compressed copy of each image which can be decoded at a decoder (column 3, lines 12-25 and 
first and second coders 32 and 34: areas of interest are coded with high quality; all other areas 
are coded with low quality; each image is then concatenated into a coded video signal which can 
be decoded). 

Jacquin does not disclose that the areas of interest are identified by a group of viewers, 
and that the group of viewers comprises a statistically representative subset of an intended 
audience in order to predict areas of interest of the intended audience. 

Rather, as shown in figure 1, Jacquin' s system automatically extracts objects of interest 
through the use of motion estimation, edge extraction, and frame differencing. Once the objects 
of interest are so extracted, the ellipse identifier 30 assigns ellipses of varying sizes to objects 
that are to be tracked. Essentially, Jacquin' s method automatically extracts areas of interest in a 
sequence of images rather than allowing a group of viewers to identify the areas of interest. 

Borah discloses a method for extracting and displaying the visual response of a group of 
viewers when presented a video. The concept advanced in Borah is that, when a group of viewers 
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is shown a video sequence, their eye gazes are monitored and recorded, thereby identifying areas 
of interest that represents the "looking behavior" of the viewers (see column 2, line 63 though 
column 3, line 56, and column 4, lines 9-21). The purpose is to identify areas and objects of the 
video that generate high interest to the group of viewers (column 5, lines 57-68), in contrast to 
Jacquin' s system, which relies on computer vision techniques to determine the areas of highest 
interest according to parameters such as the motion, shape, and cohesiveness of objects. 

Also, Borah's method contemplates monitoring the gaze points of a subset of an intended 
audience in order to predict areas of interest from the intended audience (column 7, lines 7-19). 
For example, the gaze points of a group of males are generated such as shown in figure 6. The 
response of the small group of males provides a prediction of how a different or larger group of 
males would respond to the same stimuli. The goal of this type of analysis is to determine the 
"commercial feasibility" of visual or audiovisual presentations for effectiveness in 
communicating information to a given group of viewers (column 6, lines 25-35). The analysis 
serves as a sort of pre-screening process by which advertisers and the like may determine the 
effectiveness of generating physiological interest of particular objects in a video. 

Borah's system of monitoring the gaze point of a group of viewers essentially provides a 
method by which objects of interest in a video sequence are identified. It would have been 
obvious at the time of the invention to modify Jacquin by Borah to identify the areas of interest 
by a group of viewers that comprise a statistically representative subset of an intended audience 
to predict areas of interest for that audience, rather then automatic methods disclosed by Jacquin, 
since Borah teaches that extracting areas of interest according to a group of viewers provides 
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areas of interest that are physiologically important to the viewers and that mimic the looking 
behavior of the viewers. 

Regarding claim 2, Jacquin discloses creating a quantization map based on the identified 
areas of interest (figure 1: first coder 32 creates a quantization mapping for the areas of interest), 
wherein: 

the encoding is performed based on the quantization map (figure 1 : the "coded video 
signal" is based on the quantization map created by the first coder 32). 

Regarding claim 6, Borah discloses tracking the gaze points as claimed (column 8, lines 

30-37). 

Regarding claim 8, Jaquin discloses extrapolating areas of interest for a remainder of 
images in the sequence from the identified areas of interest in said subset (Borah, column 4, lines 
61-65: areas of interest are calculated for predetermined intervals of frames, and Jacquin, column 
7, lines 51-61: the areas of interest (ellipses) of the current frame are used to predict, or create 
extrapolated versions of, areas of interest in a successive frame; the predicted (extrapolated) 
areas of interest are then used to positively locate the actual areas of interest). 

Regarding claim 9, Jacquin' s sequence of related images is a digital motion picture. 

Regarding claim 10, Borah discloses a visual histogram to determine the most popular 
areas according to the location of fixation points (see figures 1, 2, and 6). 

Regarding claim 11, Borah's areas of interest are identified in real time during a live 
transmission of the image to a group of viewers. 

Regarding claim 12, Jacquin' s digital image spatially represents the image to be encoded. 
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Regarding claim 15, Jacquin incorporates the use of DCTs (column 10, lines 9-13). 

Regarding claim 16, Jacquin discloses the quality level for blocks of pixels is adjusted for 
the areas of interest through the use of a quantization scale factor encoded for each block of 
pixels (figure 1 : first coder 32 utilizes an adjusted "fine" quantization scale factor). 

Regarding claims 17 and 19, Jacquin discloses adjusting the quality level of the 
unidentified areas downward using one of the claimed methods (figure 1: second coder 34 
adjusts the quality level downward by quantizing the coefficients, thereby frequency-filtering the 
image). 

Regarding claim 20, Jacquin discloses a method (figure 1) of digital image compression 
comprising: 

identifying a plurality of areas of interest in the digital image (30); 

sampling the identified areas of interest at a higher spatial resolution than unidentified 
areas of the image (32: areas identified by ellipses are quantized with a higher spatial resolution); 

encoding the identified areas of interest at a first quality level for transmission to a 
decoder in one ore more additional data streams (first coder 32 produces a first data stream of 
finely quantized image data for the identified areas); and 

encoding the unidentified areas of the image at a second and lower quality level than the 
identified areas for transmission to the decoder in a separate data stream from that containing the 
identified areas (second coder 34 produces a second data stream (separate from the first data 
stream) of coarsely quantized image data for the unidentified areas; 

wherein said data stream containing said unidentified areas does not contain any 
information needed to recreate said identified areas of interest (as can be seen in figure 1, coding 
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of the unidentified areas is exclusive of the coding of the identified areas and does not contain 
information pertaining to the identified areas). 

Jacquin does not disclose that the areas of interest are identified by a group of viewers, 
and that the group of viewers comprises a statistically representative subset of an intended 
audience in order to predict areas of interest of the intended audience, however, the inclusion of 
these limitations would have been an obvious modification in view of Borah. See the explanation 
for claim 1 . 

Regarding claim 22, Jacquin discloses first and second coding methods, as claimed (32 
and 34, figure 1). 

Regarding claims 23 and 24, Borah's areas of interest are identified in real time during a 
live transmission and display of the image to a group of viewers. 

Regarding claim 26, Jacquin maintains a constant bit rate (column 10, lines 45-47: 
constant frame rate). 

Regarding claim 28, the areas of interest in both Jacquin and Borah are statistically 
recorded (e.g. their coordinates/dimensions are recorded). 

Regarding claim 32, Jacquin discloses a system for digital image compression (figure 1), 
comprising: 

means for identifying a plurality of areas of interest in at least a subset of digital images 
in a sequence of related images (column 3, lines 2-25: areas of interest in a current video frame 
are identified by the assignation of ellipses via the ellipse identifier 30); 
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an encoder for encoding the identified areas of interest at a first quality level and 
unidentified ares of the image at a second and lower quality level than the identified areas in 
order to produce a single compressed copy of each image which can be decoded at a decoder 
(column 3, lines 12-25 and first and second coders 32 and 34: areas of interest are coded with 
high quality; all other areas are coded with low quality; each image is then concatenated into a 
coded video signal which can be decoded). 

Jacquin does not expressly disclose a digital image display. 

Jacquin does not disclose that the areas of interest are identified by a group of viewers, 
and that the group of viewers comprises a statistically representative subset of an intended 
audience in order to predict areas of interest of the intended audience, however, the inclusion of 
these limitations would have been an obvious modification in view of Borah. See the explanation 
for claim 1 . 

Regarding claim 33, Jacquin discloses creating a quantization map based on the identified 
areas of interest (figure 1 : first coder 32 creates a quantization mapping for the areas of interest), 
wherein: 

the encoding is performed based on the quantization map (figure 1 : the "coded video 
signal" is based on the quantization map created by the first coder 32). 

Regarding claim 37, Borah discloses the claimed eye tracking mechanisms. 

Regarding claim 39, Jaquin discloses extrapolating areas of interest for a remainder of 
images in the sequence from the identified areas of interest in said subset (Borah, column 4, lines 
61-65: areas of interest are calculated for predetermined intervals of frames, and Jacquin, column 
7, lines 5 1-61 : the areas of interest (ellipses) of the current frame are used to predict, or create 
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extrapolated versions of, areas of interest in a successive frame; the predicted (extrapolated) 
areas of interest are then used to positively locate the actual areas of interest). 

Regarding claim 40, Jacquin' s sequence of related images is a digital motion picture. 

Regarding claim 41, Borah discloses a visual histogram to determine the most popular 
areas according to the location of fixation points (see figures 1, 2, and 6). 

Regarding claim 42, Borah's areas of interest are identified in real time during a live 
transmission of the image to a group of viewers. 

Regarding claim 43, Jacquin' s digital image spatially represents the image to be encoded. 

Regarding claim 46, Jacquin incorporates the use of DCTs (column 10, lines 9-13). 

Regarding claim 47, Jacquin discloses the quality level for blocks of pixels is adjusted for 
the areas of interest through the use of a quantization scale factor encoded for each block of 
pixels (figure 1: first coder 32 utilizes an adjusted "fine" quantization scale factor). 

Regarding claims 48 and 50, Jacquin discloses adjusting the quality level of the 
unidentified areas downward using one of the claimed methods (figure 1: second coder 34 
adjusts the quality level downward by quantizing the coefficients, thereby frequency- filtering the 
image). 

Regarding claim 51, Jacquin discloses a system (figure 1) of digital image compression 
comprising means for: 

identifying a plurality of areas of interest in the digital image (30); 

sampling the identified areas of interest at a higher spatial resolution than unidentified 
areas of the image (32: areas identified by ellipses are quantized with a higher spatial resolution); 
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encoding the identified areas of interest at a first quality level for transmission to a 
decoder in one ore more additional data streams (first coder 32 produces a first data stream of 
finely quantized image data for the identified areas); and 

encoding the unidentified areas of the image at a second and lower quality level than the 
identified areas for transmission to the decoder in a separate data stream from that containing the 
identified areas (second coder 34 produces a second data stream (separate from the first data 
stream) of coarsely quantized image data for the unidentified areas; 

wherein said data stream containing said unidentified areas does not contain any 
information needed to recreate said identified areas of interest (as can be seen in figure 1, coding 
of the unidentified areas is exclusive of the coding of the identified areas and does not contain 
information pertaining to the identified areas). 

Jacquin does not expressly disclose a digital image display. 

Jacquin does not disclose that the areas of interest are identified by a group of viewers, 
and that the group of viewers comprises a statistically representative subset of an intended 
audience in order to predict areas of interest of the intended audience, however, the inclusion of 
these limitations would have been an obvious modification in view of Borah. See the explanation 
for claim 1 . 

Regarding claim 53, Jacquin discloses first and second coding methods, as claimed (32 
and 34, figure 1). 

Regarding claims 54 and 55, Borah's areas of interest are identified in real time during a 
live transmission and display of the image to a group of viewers. 
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Regarding claim 57, Jacquin maintains a constant bit rate (column 10, lines 45-47: 
constant frame rate). 

Regarding claim 59, the areas of interest in both Jacquin and Borah are statistically 
recorded (e.g. their coordinates/dimensions are recorded). 

Regarding claims 63 and 64, Jacquin discloses a method and system for digital image 
compression, comprising: 

identifying a plurality of areas of interest in each digital image in a digital motion picture 
(column 3, lines 2-25: areas of interest in a current video frame are identified by the assignation 
of ellipses via the ellipse identifier 30); 

encoding the identified areas of interest of each image at a first quality level and 
unidentified areas of interest ata a second and lower quality level than the identified areas in 
order to produce a single compressed copy of each image which can be decoded at a standard 
decoder (column 3, lines 12-25 and first and second coders 32 and 34: areas of interest are coded 
with high quality; all other areas are coded with low quality; each image is then concatenated 
into a coded video signal which can be decoded). 

Jacquin does not disclose that the areas of interest are identified by tracking the eye gaze 
of a group of viewers using eye tracking mechanisms, and that the group of viewers comprises a 
statistically representative subset of an intended audience in order to predict areas of interest of 
the intended audience, however, the inclusion of these limitations would have been an obvious 
modification in view of Borah. See the explanation for claim 1. 



Application/Control Number: 09/821,104 
Art Unit: 2623 



Page 12 



4. Claims 25 and 56 are rejected under 35 U.S.C. 103(a) as being unpatentable over Jacquin 
in view of U.S. Patent 6,476,873 by Maeng. 

Regarding claims 25 and 56, Jacquin is silent to reducing the quality of the unidentified 
areas for security purposes. 

Maeng discloses a similar encoding system, wherein regions of interest within an image 
are encoded at higher quality levels than the unidentified areas. In, particular, Maeng teaches 
that, inter alia, reducing the quality of the unidentified areas is useful for remote security systems 
(column 2, lines 44-52). 

It would have been obvious to one of ordinary skill in the art at the time of the invention 
to modify Jacquin by Maeng to achieve the claimed invention since Maeng teaches that, inter 
alia, reducing the quality of unidentified areas is useful for security purposes. 

5. Claims 27 and 58 are rejected under 35 U.S.C. 103(a) as being unpatentable over Jacquin 
in view of Borah, and further in view of "Lossy/Lossless Region-of-Interest Coding Based on 
Set Partitioning in Hierarchical Trees" by Atsumi et al. ("Atsumi"). 

Regarding claims 27 and 58, Jacquin is silent to transmitting the higher quality areas first, 
followed by the lower quality areas. 

Atsumi teaches shifting the highest priority areas of interest (ROIs) to the beginning of 
the bitstream so that so that the areas of higher interest are transmitted before the areas of lower 
interest (see section 2.1). As a result, the image is built up starting with the areas of high interest. 
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It would have been obvious to one of ordinary skill in the art at the time of the invention 
to modify Jacquin by Atsumi to achieve the claimed invention since Atsumi teaches that 
transmitting and building up the highest priority areas first "enables the user to terminate 
transmission as soon as the ROI is reconstructed with a quality acceptable to the user, thus 
saving bandwidth (or time) and computational cost" (section 1, 1 st paragraph). 

6. Claims 14 and 45 are rejected under 35 U.S.C. 103(a) as being unpatentable over Jacquin 
in view Borah, and further in view of U.S. Patent 6,256,423 by Krishnamurthy et al. 
("Krishnamurthy"). 

Regarding claims 14 and 45, Jacquin is silent to providing a gradual transition as claimed. 

Krishnamurthy discloses a similar system for coding ROIs, wherein a transition region is 
provided to create a gradual transition between differently coded areas (see figure 1). 

It would have been obvious to one of ordinary skill in the art at the time of the invention 
to modify Jacquin and Borah by Krishnamurthy to achieve the claimed invention since 
Krishnamurthy teaches that a transition region should be included to avoid abrupt variations in 
quality (column 4, lines 60-64). 

7. Claims 7, 13, 38, and 44 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Jacquin in view of Borah, and further in view of U.S. Patent 6,144,772 by Garland et al. 
("Garland"). 

Regarding claims 7 and 38, Jacquin does not disclose the areas of interest are designated 
by a pointing device as claimed. 
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Garland discloses an image compression system similar to that of Jacquin wherein areas 
of interest are designated and encoded at different levels of quality. Whereas Jacquin teaches the 
automatic selection of areas of interest in a video frame, Borah and Garland teach that viewers 
may designate the areas of interest. Garland further teaches that areas of interest can be 
designated by users' manipulation of a pointing device (column 2, lines 11-15). 

It would have been obvious to one of ordinary skill in the art at the time of the invention 
to modify Jacquin and Borah by Garland to achieve the claimed invention since Garland 
discloses that areas of interest can be designated based on user preference rather than automatic 
selection in accordance with Borah, and Garland shows that using a pointing device to allow 
users to select areas of interest is a conventional technique. 

Regarding claims 13 and 44, Jacquin does not disclose assigning, to areas of interest, 
first values to areas of higher interest and second values to areas of lower interest; and 

encoding each area of interest at a quality level corresponding to the assigned value, as 
claimed. 

Garland discloses an image compression system similar to that of Jacquin wherein areas 
of interest are designated and encoded at different levels of quality. Whereas Jacquin designates 
that all areas of interest are to be encoded with "fine quantization" (figure 1, first coder 32), 
Garland discloses that the areas of interest within an image may have varying levels of 
importance, and therefore, the areas of interest are assigned different values based on the amount 
of interest in each area (column 6, line 6 through column 7, line 8). 

It would have been obvious to one of ordinary skill in the art at the time of the invention 
to modify Jacquin by Garland to assign different values to the areas of interest, based on relative 
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importance, and encode those values, since Garland shows that this feature provides more 
flexibility in that each of the areas of interest may be encoded according to different quality 
levels. 

8. Claims 18 and 49 are rejected under 35 U.S.C 103(a) as being unpatentable over Jacquin 
in view Borah, and further in view of U.S. Patent 5,896,176 by Das et al. ("Das"). 

Regarding claims 18 and 49, Jacquin utilizes the DCT transform but does not expressly 
disclose using a wavelet transform to encode the image. 

Das discloses an MPEG coding system, similar to that of Jacquin, wherein wavelet 
encoding is utilized in lieu of DCT coding (column 10, lines 40+). 

It would have been obvious to one of ordinary skill in the art at the time of the invention 
to modify Jacquin and Borah by Das to utilize wavelets for encoding, since Das teaches that, in 
the MPEG environment, wavelet transforms may advantageously replace the DCT transform for 
the purposes of image encoding. 

9. Claims 30, 31, 61, and 62 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Jacquin, in view of Borah, and further in view of U.S. Patent 6,356,664 by Dunn et al. ("Dunn"). 

Regarding claims 30 and 61, Jacquin is silent to enhancing the quality levels of 
unidentified areas to artificially create additional areas of interest to draw a viewer's attention, as 
claimed. 

Dunn discloses a system for priority-encoding regions of interest in video data, similar to 
that of Jacquin. In particular, Dunn discloses enhancing the quality levels of certain objects in an 
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image in order to draw a viewer's attention to the objects (figure 9 and column 8, line 61 through 
column 9, line 10). 

It would have been obvious to one of ordinary skill in the art at the time of the invention 
to modify Jacquin and Borah by Dunn to enhance certain areas of the image to draw attention to 
those areas, as claimed, since Dunn teaches that this feature is useful for advertising purposes. 

Regarding claims 31 and 62, Dunn discloses the areas of interest are products (column 8, 
lines 61-67). 



Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Colin M. LaRose whose telephone number is (703) 306-3489. If 
attempts to reach the examiner by telephone are unsuccessful, the examiner's supervisor, Amelia 
Au, can be reached on (703) 308-6604. The fax phone number for the organization where this 
application or proceeding is assigned is (703) 872-9306. 

Any inquiry of a general nature or relating to the status of this application or proceeding 
should be directed to the TC 2600 Customer Service Office whose telephone number is (703) 
306-0377. Art /I 
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